Multi-Level Compositional Reasoning for Interactive Instruction Following

نویسندگان

چکیده

Robotic agents performing domestic chores by natural language directives are required to master the complex job of navigating environment and interacting with objects in environments. The tasks given often composite thus challenging as completing them require reason about multiple subtasks, e.g., bring a cup coffee. To address challenge, we propose divide conquer it breaking task into subgoals attend individually for better navigation interaction. We call Multi-level Compositional Reasoning Agent (MCR-Agent). Specifically, learn three-level action policy. At highest level, infer sequence human-interpretable be executed based on instructions high-level policy composition controller. middle discriminatively control agent’s alternating between various independent interaction policies. Finally, at lowest manipulation actions corresponding object masks using appropriate Our approach not only generates human interpretable but also achieves 2.03% absolute gain comparable state arts efficiency metric (PLWSR unseen set) without rule-based planning or semantic spatial memory. code is available https://github.com/yonseivnl/mcr-agent.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Alignment-Based Compositional Semantics for Instruction Following

This paper describes an alignment-based model for interpreting natural language instructions in context. We approach instruction following as a search over plans, scoring sequences of actions conditioned on structured observations of text and the environment. By explicitly modeling both the low-level compositional structure of individual actions and the high-level structure of full plans, we ar...

متن کامل

Compositional Reasoning for Multi-modal Logics

We provide decomposition and quotienting results for multimodal logic with respect to a composition operator, traditionally used for epistemic models, due to van Eijck et al. (Journal of Applied NonClassical Logics 21(3–4):397–425, 2011), that involves sets of atomic propositions and valuation functions from Kripke models. While the composition operator was originally defined only for epistemic...

متن کامل

Multi-faceted evaluation of interactive remote instruction

Over the past five years, we have used our Interactive Remote Instruction (IRI) system to teach classes to various audiences. In this paper we evaluate IRI from three perspectives: technical performance, student perceptions, and instructional view. IRI scales well to multiple delivery sites and supports classes of 30 or more students, though some tools do not scale well. We compared perceptions...

متن کامل

Compositional Reasoning for Pointer Structures

Canonical trace model of Hoare and He supports a satisfactory theory of graph properties. We use it to define a technique for the general composition of properties that extends the parallel-by-merge of Unifying Theories of Programming, and apply that to unique decompositions. Applications are provided to the fundamental concepts of acyclicity, reachability and canonicity; and those are used, in...

متن کامل

Compositional Reasoning for Epistemic Logics

We provide decomposition and quotienting results for multi-modal logic with respect to a composition operator, traditionally used for epistemic models, due to van Eijck et al. (Journal of Applied NonClassical Logics 21(3–4):397–425, 2011), that involves sets of atomic propositions and valuation functions from Kripke models. While the composition operator was originally defined only for epistemi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25094